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ABSTRACT 

Video- and sound- tape recordings of 96 lessons in one 
Finnish school were analyzed using the Bellack Classification System. 
The lessons were composed of Finnish, arithmetic, civics, drawing, 
religion, and music for Grades 3 and 4. During the coding process, it 
was found that the Bellack system was not very suitable for 
describing instruction at the elementary level. Consequently, the 
system was modified. Results are presented for the following 
categories: pedagogical moves; logical, instructional, rating, and 
extralogical substantive meanings; and cycles. The results indicated 
that the teacher's role that emerged was rather similar to Bellack' s 
description. However, differences were found in the results for the 
pupil's role. (JS) 
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Investigations into the Instructional Process 
V Experiences with the Bellack Classification System 



1. Introduction 

At the University of Helsinki Institute of Education, 
a research group is engaged in the analysis of the instruc- 
tional process under the direction of Professor Matti Kos- 
kenniemi 1 ^. Since 1967 video- and sound-tape recordings of 
lessons at one elementary school class grades 3 and 4 have 
been made for the purpose. The tapes have been analyzed em- 
ploying the Bales, Bellack and Flanders classification sys- 
tems. The material dealt with in this paper was recorded 
between fall 1967 and spring 1969. As the composition of 
the class remained virtually unchanged throughout this pe- 
riod of time and as there were only two teachers, this in- 
vestigation has to be regarded as a kind of case study. The 
material consisted of 96 lessons, analyzed by the Bellack 
system, the school subjects covered being Finnish, arith- 
metic, civics, drawing, religion, and music. The codings 
were carried out directly from the tapes, and thus the in- 
teraction was not written down. 

Even when coding was being practiced, it was found that 
the Bellack system was not very well suitable as su_:. for 
the description of instruction at the elementary level. In 



1 ^ The other members of the group are: 

Anna-Kaarina Falck, Pentti Holopainen, Kai Karma, 
Erkki Komulainen and Risto Skoglund. 
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several cases the categories were too many in number, it 
was difficult for the coders to perceive so slight differ- 
ences unanimously, and the frequencies in many categories 
remained very low. Therefore the system was simplified. A 

r 

brief account is given below of the system in the form it 
was applied here. 

Pedagogical moves are identical with those in the original 
syst em . 

Loginal meanings are coded into a single category. The sub- 
stantive-logical and instructional -logical categories proved 
difficult to distinguish from each other., and in most cases 
their codings were the same nonetheless. 12 EF , DED, DEC and 
INT were combined because of low frequencies and slight 
differences in meaning. The combined category was designated 
DEF. For similar reasons, categories OPN and JUS were com- 
bined into a single category, termed OPN. Thus the main cate- 
gory of logical meanings comprises four subcategories: DEF, 
FAC, XPL and OPN. 

Instructional meanings . LOG and ACT were omitted. LOG can be 
coded into other categories (DEF, LAM, STA). There was a ten- 
dency for ACT to become a category to which all utterances 
difficult of coding were assigned. A category DIS was separat 
ed from PRC to signify the maintenance of discipline. Thus 
the main category of instructional meanings included the fol- 
lowing subcategories: ASG, MAT,. PRC, PER, STA, ACV, ACC, ACP, 
ACE, DIS, LAM. ■ 

Rating and extralogical categories . Evaluative and extralo- 
gical meanings were separated from the instructional main 
category to form a group on their own. NEG and NAD were com- 
bined into one, because they were difficult to distinguish. 
AON and PON were combined with DIR, mainly because of low 
frequencies. Thus the rating and extralogical categories 
were POS, ADM, RPT , QAL, NEG, DIR, PRF . 



Substantive meanings . These were not classified. Only their 
occurrence in the utterance was coded. An utterance was con- 
sidered to involve substantive meanings when it dealt with 
matters associated with the theme of the ongoing lesson. 

2. Reliability 

Disagreement between two coders can, in principle, arise 
in two different ways: 

a) Interaction is divided into units differently. One 
coder may regard as a separate move a part of an utterance 
which the other combines with the preceding move or, which 
often happens when coding is done directly from the tape, 
the coders hear differently and omit parts of interaction 
that they consider to be meaningless. 

b) The division into units is identical, but different 
codings are given to the units. 

In determining reliability, those, parts of interaction 
that are registered only by one coder and not by both have 
been omitted. These have accounted for some 10 - 15 % of the 
total moves. If the intention is to determine the systematic 
differences between the coders, it suffices to compare the 
total markings in each category with each other. The relia- 
bility index will then be the percentage ratio of the smaller 
to the larger number. Ordinarily, reliability has been com- 
puted by comparing through cross-tabulation the codings by 
the two coders move by move. The reliability index for any 
one category will thereby be the percentage proportion of 
identical codings of all the codings in the category. 

The reliability coefficients expressed as percentages 
have the drawback that they provide a misleading picture of 
agreement between the coders in cases where there is little 
agreement. Even when there is no agreement, i.e., when the 
codings are distributed among the categories at random, per- 
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centage indexes are definitely positive. Also, for one accus- 
tomed to the use of correlative indexes it is difficult to 
obtain a correct picture of percentage indexes. 

For these reasons it is suggested that reliability be com- 
puted as phi correlation coefficients. When the codings have 
been cross-tabulated, it is possible to find out both the num- 
ber of cases where the coders’ codings are similar and the 
number of cases where they are different. These can be written 

in a 2x2 table by dividing both the similar and different 

codings evenly on their own diagonals. The halves of the diag- 
onals will then be the same (a = d and b = c) and the margi- 
nal sums will be equal (a + b = a+c = b+d = c+d). Hence, the 

fomula of the coefficient can be simplified: 

ad - be _ a 2 -b 2 _ a 2 -b 2 

/la+b)(a+c)(b+d) (c+d ) /(a + b ) 4 (a +b ) 2 

In the case of move types and logical categories we have a 
4x4 table, where the number of off-diagonal cases is three 
times that of diagonal cases if we have a chance distribution, 
i.e., zero correlation. In order that each cell of the 2x2 
table should include the same number of cases ( = zero correla- 
tion), the number of off-diagonal cases must be divided by 
three. When written in terms of the frequencies, the formula 

f = number of diagonal cases 
f = number of off-diagonal cases. 



then becomes: 





When use is made of Bellack’s original classification, we have 
a 4x4 table only for the move types, whereas the tables 
for the other main categories will be larger. For the sake of 
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uniformity, it is suggested that the coefficient 3 be used 
with all of the main categories. The reliability coefficient 
will then be the same if the percentage proportion of simi- 
lar codings is the same. Moreover, a case produced truly by 
chance is difficult to define, except for move types, since 
the rest of the categories do not exclude one another equal- 
ly clearly. 

In order to make the reliabilities for the major catego- 
ries and subcategories mutually comparable, f- has to be di- 
vided by two in computing the index for a subcategory or, 
which is often more expedient, f has to be multiplied by 
two. This is necessary in order not. to double-count the dif- 
ferent codings. 
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The reliability of the entire main category: 




.75 



Reliability of subcategory A: 

(2 x 20 ) 2 - ( j~ ) = 1600 - 44 

2 x 20 + (ir - ) 2 2171 



The consistency of any one coder has usually been rather 
good. In one case, where the interval between the codings was 
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approximately half a year, the following reliabilities (phi 
coefficients) were obtained: move types, .95; logical cate- 
gories, .92; instructional categories, 72; extralogical and 
rating categories, 78; substantive meanings, .86. 

i * ’ 

The differences between two coders often tend to become 
systematic. To avoid this, obscure points should be repeated- 
ly controlled, example codings should be collected, etc. Typ- 
ical between-coder reliability coefficients: move types, 

80 - 90; logical categories, 60 - 75; instructional catego- 
ries, 65 - 85; extralogical and rating categories, 70 - 80; 
substantive meanings, 70 - 80. 

SOL and RES are the most, clear-cut categories and are not 
in need of any supplementary definition. By contrast, REA and 
in particular, STR are obscure in many instances, mainly in 
the sense that they are confounded with each other. At the 
grade-level dealt with in this study, pupils are very spon- 
taneous, and this renders coding difficult. In cases where 
REA is associated with STA, i . e. , wheJ^jj^t is a comment on 
the preceding utterance, the interpretation is clear. Also, 
agreement is good regarding STR moves that are clearly for- 
wards directed and initiate cycles in which the next move is 
either a SOL or REA move. However, among pupils' utterances, 
in particular, there are many that are not closely connected 
with the ongoing interaction; they are neither forwards nor 
backwards directed, nor do they usually result in changes in 
the course of interaction. They include expressions of ideas 
that suddenly occur to the pupils, answers meant to be amus- 
ing and usually addressed to other pupils, pupils' talk to 
one another not related to the topic discussed, etc. The re- 
liability of STR might improve if not only utterances direct- 
ed forwards but also those that are not clearly backwards- 
directed were included in it. 

Of the logical categories, FAC and XPL are easily con- 
founded. The difference between the two is often obscure and 
there is room for various interpretations; thus, a more de- 
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tailed definition seems to be called for. 

Of the instructional categories, PRC, MAT and ASG are 
easily confounded. 

Particularly two evaluative^categories , PDS and ADM, are 
easily confounded. The information provided by them is inter- 
esting, however, and thus it would be desirable to have their 
reliability improved without combining them in one. Coding is 
sometimes made difficult by the fact that for such neutral re- 
actions as "mm" or "uhuh" - which can be . interpreted as mean- 
ing ”1 understand" or "I heard what you said" - there are not 
symbols of their own. Such reactions could perhaps be assign- 
ed to RPT, which is also otherwise rathor neutral in meaning. 
Another possibility would be to agree that they will be coded 
REA/FAC/ACC. 

3. Results 

3 . 1 . Pedagogical Moves 

STR move. Here the results differ nost clearly from Bellack's 
results. On an average, 17.5 % of all the moves in our mate- 
rial were STR moves, and almost a half of these moves (43 %) 
are accounted for by pupils. In Bellack’s material, no more 
than 12 % of STR moves are pupils’ moves. Even when the fact, 
indicated in discussing reliability, is taken into consider- 
ation that pupils' STR moves include talk not closely related 
to the course of the interaction , the tendency is clear. The 
frequency of occurrence of STR was highest i>i the civics les- 
sons: the proportion of pupils’ moves falling within this 
category being as large as 40 %. In those lessons STR is often 
connected with PER. 

SOL move. A similar tencency, though less pronounced, is per- 
ceptible in SOL moves. They account for 36 % of the total in- 
teraction, and 24 % of them are pupils’ moves (Bellack: 14%) . 
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Of the total moves by pupils, 22 % are SOL moves (Bellauk: 

11.3 %), the corresponding figure for teachers being 45 V. 

RES move . In contrast’ to SOL and STR moves, which were more 
frequent in pupils’ interaction here than in Bellack’s re- 
sults, RES moves were less frequent. Only 38 % of the total 
moves of pupils wfere RES moves (Bellack: 6^5.4 %). Of the 
total RE3 moves, pupils accounted for 80 percent (Bellack: 

88 %). 

REA move . Of the total moves, 27.4 % were REA moves. Here, 
too, the proportion accounted for by pupils was significant- 
ly larger than in Bellack’s material, or 31 % (Bellack: 19 %) 
The share of REA-P of the total moves was 1,3 % with pupils 
and 2.6 % with teachers. REA underlined moves are mainly 
teachers’ moves; here their share of all teacher’s moves was 

1.3 %. Less than 0.5 %' of the total moves were of the REA + 
type, and a large majority of these moves were teachers’ 
moves . 

3.2. Logical Categories 

The average proportions of the total moves accounted for 
by the various logical categories were as follows: DEF, 2.5 %; 
FAC, SO. 9 %; XPL, 8.4 %; and OPN, 2.6 %. No logical meanings 
•q, were involved in 25 % of the moves. The moves involving log- 
ical meanings were distributed approximately evenly between 
pupils and teachers, except XPL moves, which were more fre- 
quent with teachers (68.4 %). The present results differed 
very clearly from Bellack’s. Compared with Bellack's results, 
FAC was almost three times as high, at the expense of the 
other logical categories. It would seem that pedagogically 
meaningful is just the ratio that FAC bears to the other log- 
ica 1 categories. The explanation presenting itself first is 
that the difference between these and Bellack’s results is 
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ascribable to the grade-level. It would seem natural for 
instruction to include more "hard facts" on the elementary 
level than on the more advanced levels. It should be kept 
in mind, however, that the reliability of the XPL category, 
in particular, would be in need of improvement. 



3.3. Instructional Categories 

An average of 66 % of the total moves has involved in- 
structional meanings. The average proportions of the total 
moves accounted for by the various categories have been as 
follows: STA 14.9, ASG 15.9, MAT 6.3, PRC 5.7, DIS 1.6, 

PER 3.0, ACV 8.4, ACC 4.5, ACP 3.6, ACE 0.8, LAM 0.6. Both 
the total frequency (which exceeded the 46.9 % reported by 
Bellack) and the percentages for the various categories in- 
dicate that utterances related to the learning and instruc- 
tion process were more frequent here than in Bellack's mate- 
rial. Correspondingly, utterances related to the school sub- 
ject concerned were relatively less frequent here. The per- 
centages for ASG, MAT, PRC + DIS and ACP were definitely 
higher than with Bellack. Dn the other hand, STA and ACV, 
which generally have to do with the school subject itself, 
were less frequent here. The most natural explanation again 
seems to lie in the grade-level. It should also be taken into 
account that, in the present study, the material included 
subjects (such as drawing) in which the performance of tasks 
and the discussion concerning them give rise to such a ten- 
dency, irrespective of the grade-level. The frequent use of 
PER is obviously typical of the elementary level in particu- 
lar. STA, PRC, DIS, ACV, ACP and LAM were definitely more 
frequent with teachers than pupils, whereas the opposite was 
the case with PER and ACE. 
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3.4. Rating and Extralogical Categories 

4 - 

An average of 35 % of the total moves fell within rating 
and extralogical categories. Such categories were involved 
in almost a half of teachers’ moves, (46.3 %), the correspond- 
ing figure for pupils being 17.3 %. The average proportions 
of total moves accounted for by the various categories were 
as follows: DIR 2.6. PRF 15.3, POS 1.0, ADM 7.7, RPT 2.8, 

QAL 1.8 and NEG 2.4 %. Teachers used the following definite- 
ly more than pupils: PRF (53.5 % of total!, POS (95.8 %), 

ADM (92.3 %) , OAL (95.0 %). By contrast, compared with teach- 
ers, pupils used DIR (98.5 %) and NEG (53.9 %) more frequent- 
ly. The percentages for DIR and PRF, which were higher than 
those reported bv Bellack, seem to suggest that the need for 
advice and guidance is greater at the elementary level, but 
an effect has also certainly been contributed by the large 
number of work tasks, referred to above, and by the discus- 
sion of these tasks. The pupils' spontaneity is evidenced 
particularly clearly by NEG. Pupils are very eager to point 
out mistakes, committed by other pupils and also by teachers. 
On the other hand, teachers tend to avoid using NEG; instead, 
they try to show the incorrectness of, e.g., an answer by 
mi lder means. 



3.5. Substantive Meanings 



Less than half of the total moves (43.6 %) involve sub- 
stantive meanings. These are distributed approximately evenly 
between teachers and pupils. 



3.6. Cyc les 



Cycles have not yet been counted from the total material, 
the following information on them being based on studies made 
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of a part of it only. The cycles used here are of a formal 
type. The liveliness of lessons and spontaneity of pupils 
make it often difficult to form cycles. For example, the 
STR-coded exclamations give automatically rise to an ex- 
cessive number of STR-in i t iated cycles in the records if 
the cycles are formed afterwards, on the basis of the cod- 
ings alone and without paying attention to the content of 
the utterances and to the way they are actually intended to 
be related with one another. Also, cycles that "do not exist" 
often tend to emerge. In order for the cycles to follow the 
actual logical course of the discussion, i.e., in order to 
improve their validity, we have begun to indicate them in 
the records even in connection with the coding. Such cycles 
are also not temporal: frequently they do not consist of 
temporally consecutive utterances. The decisive factor is 
the speaker’s intention, in so far as this can be perceived 
during coding. Inadequate utterances will thereby not inter- 
rupt the cycle, nor will they be associated with cycles to 
which they are logically unrelated, but will, instead, form 
cycles on their own and often consist of a single utterance. 

Cycles of the most frequent occurrence were 13 (SOL), 

14 (SOL-RES), 1 (STR), 2 (STR-SOL), 3 (STR-REA) and 18 
( SOL-RES-REA ) . There was a lot of variation in the distribu- 
tion of the cycles, depending on school subject and teacher. 
The following average percentages were found: cycle 13: 

20-28 %, cycle 14: 10-14 %, cycle 1: 6-17 %, cycle 2: 6-9 %, 
cycle 3: 8-17 % and cycle 18: 10-17 %. The frequent occur- 
rence of STR moves increased the frequency of STR- i n it iated 
cycles. The high percentage of cycle 13 (SOL) may have been 
due to the grade-level; this cycle was often connected with 
categories PRF, OIS, MAT, PRC and ACP, i.e., with classroom 
activities not related to the school subject concerned. The 
liveliness of interaction is clearly reflected by the activ- 
ity index, which was generally 6-7 cycles/min. The cycles 
contain an average of about 2 moves. 
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4. Pedagogical Roles in the Light of the Results 
4.1. Teacher's Role 



The picture of a teacher's role that emerged here was 
rather similar to Bellack's description. The teacher is 
clearly the most active person in the class, about 60 % of 
the total moves being made by her. The teacher's main func- 
tion is soliciting: she tries to have the pupils' respond 
either verbally or non-verballv. The response is quite fre- 
quently a non-verbal one, as the number of pupils' RES moves 
only equals two-thirds of teachers' SOL moves. In some cases 
it is also necessary for the teacher to make more than one 
SOL move in order to obtain a single response. It is the 
teacher who principally employs SOL moves, about three- 
fourths of these being made by her. 

The teacher's function next in importance is reacting to 
pupils' responses. One third of teachers' moves are REA moves. 
What the teacher reacts to is usually verbal action but it 
may sometimes also be non-verbal action. As a rule, the teach- 
er reacts to an immediately preceding utterance, whereas sum- 
marizing reactions are comparatively infrequent. The teacher's 
evaluative reactions are usually mildly approving or neutral, 
whereas she endeavours to avoid clearly negative reactions. 
Reacting is mainly up to the teacher, about two-thirds of the 
REA moves being made by her. 

The longest STR moves and the STR moves with the greatest 
effect on the course of the game are characteristics of the 
teacher's role. In most cases the teacher makes use of a STR 
move with the object of elucidating the matter about which 
she is going to ask a question or of indicating that she will 
proceed to consider a new topic. Thus, the teacher's STR move 
is usually followed by a SOL move. 

The teacher responds comparatively rarely. What is con- 
cerned in these rare instances is usually a SOL move on the 
part of a pupil asking for advice. 

ERJC * i 15 
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4.2. Pupil’s Role 



As already indicated in several contexts, the pupils 
were here clearly more spontaneous than Bellack’s pupils. 

This does not mean, however, that the number of pupil’s 
moves would have been greater. The distribution of moves 
between teachers and pupils was approximately the same as 
with Bellack: about 60 % of the total moves were made by 
the teacher. 

As with Bellack, the pupil’s main function was to re- 
spond. Nevertheless, the RES move occupied not nearly as 
prominent a position. Dnly 38 % of pupils’ moves fell with- 
in this category. Pupils generally respond in one way or an- 
other to the stimuli provided by the teacher, but a major- 
ity of their responses are non-verbal, and thus they are 
not reflected in the codings. 

In clear contrast to Bellack’s case, the pupils in our 
study used the STR move notably often. Despite the fact that 
pupils’ STR moves were usually shorter than those of teach- 
ers and that the cycles started with such moves contained 
a lower average number of moves than the cycles initiated 
by teachers’ STR moves, pupils spontaneously influenced 
the course of interaction to a very great extent. Even if 
inadequate comments and remarks were disregarded, a great 
many utterances would remain that have been taken into con- 
sideration and that have a bearing on how the interaction 
continues. Usual are, for instance, accounts given by pupils 
of their own experiences that the matters dealt with during 
the lesson have made them recollect. Also, the pupils not 
infrequently express their own views and opinions even when 
the teacher does not ask them. Thus, in this material, the 
"don’t structure" rule does not apply. A pupil’s STR move is 
often followed by a REA move. 

About a fifth of pupils’ moves are SOL moves. A majority 
of the SOL moves addressed to the teacher appertain to in- 
structional matters and are often related to DIR. Questions 
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related to substantive matters are asked by the pupils par- 
ticularly when they have to perform lengthy tasks forming 
an integral whole and feel uncertain about how to continue 
("Can this be colored red?” etc.). SOL moves addressed to 
other pupils are comparatively usual, and they may he re- 
lated both to instructional and substantive matters. 

Although a majority of the evaluative reactions are ac- 
counted for by the teacher, pupils also resort to such re- 
actions notably often. Pupils react, particularly spontane- 
ously and eagerly to mistakes committed by others. Unlike 
teachers, the pupils do not generally attempt to soften 
down their negative reactions but express them quickly and 
sharply. Thus, the "don't react evaluativelv " rule does not 
apply in the present material. 
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